CONSTRAINED VARIATIONAL CALCULUS FOR HIGHER ORDER 

CLASSICAL FIELD THEORIES 

CEDRIC M. CAMPOS, MANUEL DE LEON, AND DAVID MARTIN DE DIEGO 

Abstract. We develop an intrinsic geometrical setting for higher order constrained field theories. 
As a main tool we use an appropriate generalization of the classical Skinner- Rusk formalism. Some 
examples of application are studied, in particular, applications to the geometrical description of 
optimal control theory for partial differential equations. 

(N 

C/3 ! 1. Introduction 



OS 



Classical field theories can be intrinsically described in terms of the geometry of a fiber bundle 
it : E — >■ M and its associated higher order jet bundles, J k n ( </% for the case of first order field 
theories). The jet bundles give a geometrical description for higher order partial derivatives of 
r- 1 the fiber coordinates of E with respect to those of M. 

As in almost every physical theory, the dynamics of a classical field theory is completely deter- 
mined by a Lagrangian function, that is, a function on the corresponding jet bundle, L : J k n — > WL. 
For a theory of order k, the solutions are sections of the fiber bundle it : E — >■ M such that 
£vi | they extremize the functional 

(N 

in 



o 
o 
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X 



S{4>) = / L(j k <J>) V , 
Jr 
where rj is a fixed volume form (it is assumed that M is orientable), R C M is a compact set and 
j k (p is the fc-jet prolongation of </>. 

The most basic result on variational calculus is the construction from the above functional of a 
set of partial differential equations, the Euler-Lagrange equations, which must be satisfied by any 
smooth extremal. More interesting, the property of extremizing the problem does not depend 
on the particular chosen coordinate system (fact noted by J. L. Lagrange during his studies of 
analytical mechanics), therefore it must be able to write the Euler-Lagrange equations in an 
jj] ■ intrinsic way. 

In this sense and restricting ourselves to first order field theories, when L : J l ix — >■ M, the 
fundamental object is the so-called Poincare-Cartan form Ql which is an (m+ l)-form (dimM = 
m) univocally associated to the Lagrangian. This form is constructed using the geometry of 
the jet bundle and it is also related with the variational background [20j. Using this form, it is 
possible to write down the Euler-Lagrange equations in an intrinsic way. Indeed, satisfies the 
Euler-Lagrange equations (that is, it is a critical point of the action S) if and only if 

(J 1( PT(^v^l) — 0, for all tangent vector V in TJ lr n . 

Moreover, this form plays an important role in the connection between symmetries and conser- 
vation laws (see 125 1). 
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But many of the Lagrangians which appear in field theories are of higher order (as for instance 
in elasticity or gravitation), therefore it is interesting to find a fully geometric setting also for these 
field theories. Besides in first order field theories it is possible to define a Cartan (m + l)-form, one 
of the main difficulties in higher order field theories is that the uniqueness of the Cartan (m+1)- 
form is not guaranteed (contrary to the first order case). In other words, there will be different 
Cartan forms which carry out the same function in order to define an intrinsic formulation of 
Euler-Lagrange equations. The main reason of this problem, is the commutativity of repeated 
partial differentiation. In the literature, we find different approaches to fix the Cartan form for 
higher order field theories. For instance the approach by Arens [5] which consists of injecting 
the higher-order Lagrangian to a first-order one in an appropriate space by the introduction into 
the theory of a great number of variables and Lagrange multipliers. Following this perspective, 
and from a more geometrical point of view, the reader is refereed to the papers by Aldaya and 
Azcarraga J3JH]. A different approach is adopted by Garcia and Munoz whom described a method 
of constructing global Poincare-Cartan forms in the higher order calculus of variations in fibered 
spaces by means of linear connections (see |T7t ITS]), in particular they show that the Cartan 
forms depend on the choice of two connections, a linear connection on the base M and a linear 
connection on the vertical bundle Vtt. In 1990, Crampin and Saunders [32J proposed the use of 
an operator analogous to the almost tangent structure canonically defined on the tangent bundle 
of a given configuration manifold M for the construction of global Poincare-Cartan forms. In j6] 
we have found an intrinsic formalism for higher order field theories avoiding the problem of the 
degree of arbitrariness in the definition of Cartan forms for a given lagrangian function. We have 
proposed a differential geometric setting for the equations of motion derived from a higher order 
Lagrangian function, strongly based on the so-called Skinner and Rusk formalism for mechanics 
(see also the paper by Vitagliano [35J). 

In this paper, we introduce constraints in the picture. The constraints are geometrically defined 
as a submanifold C of J k n and a function L : J k n — > WL. In other words, we impose the constraints 
on the space of sections where the action is defined. We will show that the formalism introduced 
in our previous paper [6] is adapted to the case of constrained field theories, deriving an intrin- 
sic framework of the constrained Euler-Lagrange equations. For the geometrical description, we 
start with the fibration itw,m '■ W —* M, where W is the fibered product J 7T Xjk-i^ A™(J fc ^ 1 7r), 
and we induce a submanifold Wq of W using the constraints given by C. Using that W has 
a naturally (pre-)multisymplectic form (by pulling back the canonical multisymplectic form on 
A™(J h ~ l Tc)), we deduce an intrinsic and unique expression for a Cartan type equation for the 
Euler-Lagrange equations for constrained higher-order field theories. Additionally, we obtain a 
resultant constraint algorithm and we give conditions to ensure that the final constraint subman- 
ifold is multisymplectic. Finally, we discuss some examples to illustrate the theory. 



2. Notation 

Throughout the paper, we will stick to the following notation. It will be reminded when 
necessary but, for the ease of the reader, we put it here together. 

Lower case Latin (resp. Greek) letters will usually denote indexes that range between 1 and 
m (resp. 1 and n). Capital Latin letters will usually denote multi-indexes whose length ranges 
between and k. In particular and if nothing else it is stated, / and J will usually denote multi- 
indexes whose length goes from to k — 1 and to k, respectively; and K (and sometimes R) 
will denote multi-indexes whose length is equal to k. The Einstein notation for repeated indexes 
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and multi-indexes is understood but, for clarity, in some cases the summation for multi-indexes 
will be indicated. 

We denote by (E, tt, M) a fiber bundle whose base space M is a smooth manifold of dimension 
m, and whose fibers have dimension n, thus E is (m + n)-dimensional. Adapted coordinate 
systems on E will be of the form (x\ u a ), where (x l ) is a local coordinate system on M and (u a ) 
denotes fiber coordinates. The local volume form associated to (x 1 ) is d m x = dx 1 A ■ ■ • A dx m . 
We write d m ~ 1 Xi for the contraction ia/dx i d m x. Note that, for a fixed index % G {1, . . . ,m}, we 
have that dx 1 A d m ~ 1 x i = d m x. 

In section £jH an d all later sections, M shall be supposed oriented and a volume form 77 will be 
provided. Local coordinates (x 1 ) on M will then be considered subject to compatibility with 77, 
that is, such that d m x = 77. Any m-form along a fiber projection over M will be denoted with a 
curly letter. The same letter in a straight font shall denote its factor with respect to either, the 
volume form 77 or the local one d m x, e.g. C = Ld m x, H = Hd m x, etc. 

Finally, pullbacks of forms and functions by fiber projections will still be denoted by the same 
symbol. For instance, 71*77 wm still be denoted 77 on E. 

3. Jet Bundles 

We fix a fiber bundle it : E — > M with dimM = m and dimE = m + n, and where neither M 
nor E are necessarily orientable. 

Let <j),if) : M — > E be two smooth local sections of tt around a given point x G M, We say 
that and ip are k-equivalent at x (with k > 0) if the sections and all their partial derivatives 
until order k coincide at x G M, that is, if 

4>{x) = ip(x) and 



dx 11 ■ ■ ■ dx %k 



dx %1 ■ ■ ■ dx %k 



for all 1 < a < n, 1 < ij < m, 1 < j < k. Note that this is independent of the chosen 
coordinate system (adapted or not) and, therefore, to be /^-equivalent is an equivalence relation 
(see [261 E3 ETJ, for more details). 

Definition 3.1. Let x G M, given a smooth local section <fi G T x ir, the equivalence class of 
^-equivalent smooth local sections (with k > 0) around x that contains is called the kth jet of 
cf) at x and is denoted j!^(j). The kth jet manifold of tt, denoted J k n, is the whole collection of kth 
jets of arbitrary local sections of tt, that is, 

(3.1) J k K = {j k x 4> : xeM, <f>er x 7r}. 

Given a (local) section of tt, its kth lift is (j h <j))(x) = j^ A 



IX* 



Adapted coordinates (x\ u a ) on the total space E induce coordinates (x\ uj) (with < \I\ < k) 
on the A;-jet manifold J fc TT given by: 



dx 1 
from where we deduce that J fc TT is actually a smooth manifold of dimension 



i- rk sr^ ( m ~ 1 + A 

dim J tt = m + n ■ > 

^-^ V 777 - 1 / 
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It is readily seen that ( J k ir, 71^, M) is a fiber bundle, where 

ftkUxfi) = x (in coordinates TCk(x\u^) = (x 1 )). 

It is also clear that the fe-jet manifold J k n fibers over the lower order /-jet manifolds J lr n (see 
Diagram [1]), with < I < k, where by definition J°n = E and where the projections are given 
by: 

Kk t l(j%<f>) = j l x 4> ( in coordinates 7c k j(x l , uj) 



(x\uT 



1 UJ)-, 



with < |/| < fc,0 < \J\ </). 



J k 7T 




Figure 1. Chain of jets 

Definition 3.2. The kth dual jet bundle, denoted J k ^ , is the subbundle of A m J fc_1 7r of 7ik-i- 
semi-basic m-forms over J fe ~ 1 7r, that is, 

(3.2) jV = A™J k -\ = {w G K n J k - l -K : i m i V2 u = 0, v ± ,v 2 G V7r fe _i} , 

where Vftk-i is the vertical bundle of itk-i '■ J k ~ lr x — > M. 

Remark 3.3. The original definition of the fcth dual jet bundle is as the extended dual affine bundle 
of the iterated jet bundle (J 1 7rfc_i, (7^-1)1,0, J k ~ 1 ii). Observe that J k TC is naturally embedded as 
an affine subbundle of J 1 7Ta ! -i. 

Locally, the elements of J k ir^ are of the form 

ptTx+ptJdufA d m - l x % , 

where < \I\ < k — 1. Thus, adapted coordinates (x l ,u a ) on E induce coordinates (x l ,uf,p,p^) 
on J k 7c^. Note that (x l , uf) are adapted coordinates on J k ~ 1 n and that (p,p^) are fiber coordi- 
nates. 

While the kth jet bundle J k n projects over the lower order jet bundles (Diagram [1]), the feth 
dual jet bundle is "embedded" into the (k + l)th dual jet bundle by means of the pullback of the 
affine projection Ttk+i,k (Diagram [2]). 



'k + l,k 



Jfc+V +^> jV V-^>. ■ ■ — ^ JV ^^ ji^t 



ffc+i.fc , 7r fc,fc-r a 
J fc+1 7T »- J fc 7T 



^3,2 _ ""2,1 "^ 7T1 

»- J 2 vr JV E *- M 



Figure 2. Chain of dual jets 
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The dual jet bundle J k ^ has a canonical multi-symplectic structure (see [71 [10]) and its 
elements are naturally paired with those of J k 7c. We recall that a form Q is multi-symplectic if 
it is closed and if its contraction with a single tangent vector is injective, that is, iyQ = if and 
only if V = 0. 

Definition 3.4. The tautological m-form on J k n* is the form given by 

(3.3) e w (K, ...,V m ) = (r^uWu ...,V m ),ue JV, V u ...,V m e T^ k -n\ 

where Tjk^\ is the natural projection from TJ k ii^ to J k/ K*. The canonical multi-symplectic (m + 1)- 
form on J k 7T' is 

(3.4) fi = -d8. 

Definition 3.5. The natural pairing between J fe 7r and its dual J k ^ is the fibered map $ : 
J fc 7r Xjfe-i^ J fe 7T 1 ' -» A m M given by 

(3-5) $(j x V,u;) = (/-V) J V 1 ^. 

Let (x l , uj , u^-) and (x l ,uf,p,pj), where |/| = 0, . . . , k — 1 and |i^| = fe, denote adapted 
coordinates on J k ir and J fe 7r^, respectively. Then, the tautological form and the canonical one 
are locally written 

(3.6) = p d m x + p T J du? A d m - l Xi and Q = - dp A d m x - dp J J A dtt? A d™" 1 ^, 
and the fibered pairing between the elements of J k 7c and J fc 7rt is locally written 

(3.7) 9(x*,u%u%,p,pS) = (p + pi l W} +li )d m x. 

With the aim of clarifying the latter sections and to make this work self content, we continue 
presenting some classical definitions besides of some basic properties, which may be found in 
[2311301 ETJ. 

Definition 3.6. Let / : E — y F be a morphism between two fiber bundles (E, 71", M) and 
(F,p,N), such that the induced function on the base, / : M — > N, is a diffeomorphism. The 
kth prolongation of f is the map j k f : J k n — y J k p given by 

(3.8) U h f )&</>) ■= i/ w ^/. Vj.V e J k n, 

where <pf := f o o / _1 . 

Note that the fcth prolongation j fc / of a morphism / is both, a morphism between ( J k ir, nk,o, E) 
and (J fc p, pk,o,F), and a morphism between (J fc 7r,7Tfc,M) and (J fc p, Pk,N). In each case, the 
induced functions between the base spaces are / and /, respectively. 

Definition 3.7. The (coordinate) total derivatives are the vector fields along i^kk-i locally given 
by 



(3-9) j„, : = a _i + U I+li a,. a a „i + u i rj„.rv + M 7'i a-. a + • • • 



d d n d d n d n d 

dx l dx i I+U du« dx i i du a ^du] 

for coordinates (x\Uj) on J k n. 
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J k <P E 



F J k i 



FIGURE 3. The kth prolongation of a morphism 

Let / G C oc (J k n), then -r4 *= C°°(J fe+1 7r) on the corresponding local chart. In general and 

as the coordinate total derivatives commute, -r-f- G C°°(J fc+ ' ,7 '7r) for any multi-index J G N m . 
Observe also that total derivatives are closely related to partial derivatives by the relation 

d |J| /_, fc+ UI, 0I j I(/ojV) 



dx J 



(3.10) tF°^ W < 

for any section <f) eTti. 

Definition 3.8. A contact 1-form on J k n is a 1-form 9 G A 1 J k ir which is pulled back to the zero 
form on M by the fcth-lift of any section of n, that is, 

(3.11) (j k (/>y9 = 0, V0Grvr. 

The set C k of all the contact forms is called the Cartan distribution (of order k). 

Proposition 3.9. Let (x 1 ,^) be adapted coordinates on J k n, a basis of the Cartan distribution 
is given by the contact forms 

(3.12) 9f = duj - u? +1 . dx\ < \I\ < k - 1. 

Definition 3.10. Given a vector field £ on E, its kth-lift (or kth-jet) is the unique vector field 
£( fe ) on J fc 7T that is projectable to £ by 7^0 and preserves the Cartan distribution with respect to 
the Lie derivative. 

Proposition 3.11. Let £ be a vector field on E. If £ has the local expression 
(3.13) 



dx l ' du c 



in adapted coordinates (x l ,u a ) on E, then its kth-lift £^ has the form 
for the induced coordinates (x l ,Uj) on J k n, where 



(3.15) 



C = T and ff +u 



u, 



dx { I+lj dx* ' 
In particular, if £ is vertical with respect to n, then £j = d' J '£ a / dx J . 
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Proposition 3.12. Let ip e be the flow of a given n-projectable vector field £ over E. Then, the 
flow of £( k > is the kth-lift ofip £ , j k ip £ . 

4. VARIATIONAL CALCULUS 

The dynamics in classical field theory is specified giving a Lagrangian density: A Lagrangian 
density is a mapping C : J k ir — > A m M. Fixed a volume form r\ on M, there is a smooth function 
L : J k n — y R such that C = Lrj. 

Definition 4.1. Given a Lagrangian density C : J k n — > A m M, the associated integral action is 
the map S '■ F7r x /C — > R given by 

(4.1) S(<P,R)= /(/</>)% 

JR 

where /C is the collection of smooth compact regions of M. 

Definition 4.2. Let be a section of n. A (vertical) variation of is a curve e G / i— > <p £ G Tn 
(for some interval I C R) such that (j) £ = (f £ o<f)o ipj 1 , where ip £ is the flow of a (vertical) 
7r-projectable vector field £ on E. 

When £ is vertical, then its flow (p £ is an automorphism of fiber bundles over the identity for 
each eel, i.e. tp e = Mm- 

Definition 4.3. We say that G Tit is a critical or stationary point of the Lagrangian action S 
if and only if 



(4.2) 



T£ [S(<Pe,R)} 



d 

=0 de 



{j k <t>e)*C 



UR 



0. 



e=0 



for any vertical variation <p £ of <p whose associated vertical field vanishes outside of it 1 (R) and 
where R £ = (p £ (R). 

Lemma 4.4. Let <p £ = !f £ o(poifj 1 be a variation of a section <p G Tir. If!; denotes the infinitesimal 
generator of (f £ , then 



(4.3) 



-[(/(^o0)>] 



e=0 



(iV):(v> w ) 



for any differential form uj G f2(J k ir). 

Proof. From Proposition 13.121 we have that ^ k ' is the infinitesimal generator of j k tp £ . We then 
obtain by a direct computation, 



(/<«:( v>^) = Ww* tz [tivr«] 



de 



, 4[(/W»>] 



e=0 



D 



The following lemma will show to be useful in the variational derivation of the higher-order 
Euler-Lagrange equations. 

Lemma 4.5 (Higher-order integration by parts). Let R C M be a smooth compact region and let 
f,g:R — > R be two smooth functions. Given any multi-index J G N m , we have that 

&\ J \f ,„ , _ lrl r „9i J i ( 



(4.4) 






d m 1 x i 
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where A is given by the expression 

(4-5) X(I f , I g , J) := (-l)W • |J/I | ! J| I , /9|! ■ j^j 



</• V 



rjl Jl n 

Proof. In this proof, we will use the shorthand notation fj = -^p- so as to safe space. 

We proceed by induction on the length I of the multi-index J . The case I = | J\ = is a trivial 
identity and the case I = \J\ = 1 is the well known formula of integration by parts 

f t gd m x = [ fgd™- 1 ^- [ f 9l d m x. 

R JdR JR 

Thus, let us suppose that the result is true for any multi-index J G N m up to length I > 1, in 
order to show that it is also true for any multi- index K G N m of length 1 + 1. Let J and 1 < j < m 
such that J + lj = K. We then have, 



[ f J+lj gd m x = - [ fjg 3 d m x+ [ fjgd m - x x 3 

JR JR JdR, 



fg J+1] d m x+ / fjgd^Xj 

R JdR 

- E Klf>I ai >i) [ f W^ d^xt, 

I f +I gj +U=J Jm 

where we have used the first-order integration formula in first place, to then apply the induction 
hypothesis. We now multiply each member by (J(j) + l)/(l + 1) and sum over J + lj = K. Using 
the multi-index identity (1B.2I) . we have 

j f K gd m x = (-1)< +1 j fg K A m x + £ ^TTT J /j« d ™"S 

77Z- TS l + l r , r , ■, _ 7 JdR 



Jj4 . 



J+lj=K If+Ig +li = J J9R 



It only remain to rearrange properly the last two terms to express them in the stated form. 
Clearly, 



JJ^U"^ 



, , i ] \K\\ Ifl-IJJaR dx*t dxh Q 



If+Ig + li=K 

\lg\=0 
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The last term is a little bit more tricky, 



E^E 



1 + 



I r „ 1 1 . \r 

'_;nlJ«yl+ 1 . l fl ' l 9 * 



J! 



J\'- W'W-JBR 



fi f 9i 3l+ i 3 d m v x % 



__ y ( . lp .6W.JL/ / 

- y: (-i) i/si e ^- j) - |//|! - |/ff|! - ^ ! 

, _ \J-n 



i f 9i aj +iA m 1% i 



I f +Ig + li=K Ig. +1.-I,, '■■'' 



Itfl! it\-U 



I^I>1 



L 9j~ 1 -] — J 9 



/ ; ' V </aR 



///»/, d 



771—1 



, /, ; 



E (-i)"- 1 -^ 



\lg\. I\. 



If+Ig + U = K 

\I a \>l 



K\ 



f If g Ig d m l Xl 
1 f- ' V JdR 



where we have used the identity (1B.2J) again. The result is now clear 



□ 



Theorem 4.6 (The higher-order Euler-Lagrange equations). Given a fiber section (ft G Ttt, let us 
consider an infinitesimal variation (f) £ of it such that the support R of the associated vertical vector 
field £ is contained in a coordinate chart (x l ). We then have that the variation of the Lagrangian 
action S at cf) is given by 



de 



S((pe,Re 



e=0 



(4.6) 



E 

|J|=0 



<-^M r ^ ,d "* 



I s +I L +li=J JdR ^ 



d |7il dL 
5 dx lL du a j 



im-l 



,L ; 



Moreover, cf) is a critical point of the Lagrangian action S if and only if it satisfies the higher-order 
Euler-Lagrange equations 



(4.7) 



cfvr E(-i) 



iji 



|J|=0 



dl J l dL 
dx J du°j 



on the interior of M , plus the boundary conditions 



(4.8) 



d' f l dL 
dx 1 du a T 



0, < |/| < \J\ < k, 



on the boundary DM of M. 
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Proof. Let us denote by £ the vertical field associated to the variation 
Cartan's formula £ = doj + jo d and Proposition I3.11[ we have that 



By Proposition 14. 4\ 



^- £ S(<Pe,R) 



£ = 



R 



CW(%*>£) 



R 



(j fe 0)*d( ¥fc) /:)+ / (j fc 0)V' d£ 



R 



I {j k <f>)\ w C + / (j k m^ k \L) d m x -dLA z eW d m x) 

dR JR 



Jm- (t 

JR \|J|=o 



dl J l£ Q dL 
dx J du°j 



d m x 



If we now apply the higher-order integration by parts (j4.4p and we take into account that Equation 
(13.10p . we obtain that 



d£ 



S(<f>e,R) 



£=0 



|J|=0 



(-D^/ fl (^r(r^^u^ 






dx 7 « dx Ih du" 



iro- 1 



•* y' 



/ 5 +/ L + l, = J 

which is the first statement of our theorem. 

If we now suppose that R is contained in the interior of M, as £ is null outside of R, so it is 
£( fc ) outside of R and, by smoothness, on its boundary OR. Thus, if is a critical point of S, we 
then must have that 






" fl )|_„= /Vvr(r£(-i: 

' Jr y |j|=o 



dx J duj 



for any vertical field £ whose compact support is contained in 7r _1 (i2). We thus infer that shall 
satisfy the higher-order Euler-Lagrange equations (J4.7P on the interior of M. 
Finally, if R has common boundary with M, we then have that 



m ^M =E Ey*W^*K^S 



im — 1 , 



,L i 



o. 



\J\=0 I s +I L +li=J 



As this is true for any vertical field £ whose compact support is contained in n 1 (R), we deduce 
the boundary conditions (14.81) . □ 

Remark 4.7. If in the Definition 14.31 of a critical point, one further requires to the variations to 
be null at the boundary dM of M, in the sense that the associated vector field £ be zero over 
TT~ 1 (dM), then would no longer have the boundary condition (j4.8p . In such a case, the space of 
solutions would be broader and one could impose boundary conditions to them in order to obtain 
a particular one. 

We also note that the theorem remains true if we choose vector variations, non necessarily 
vertical. Although the proof would be much more cumbersome. 
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5. The Skinner-Rusk Formalism 

What follows in this section has already been published in [6]. Here we give a summary of the 
most relevant facts that are proven in there, introducing some new insights and points of view. 
The generalization of the Skinner- Rusk formalism to higher order classical field theories will take 
place in the fibered product 



(5.1; 



W = J fc 7T X 



J k ~ 



J 



7T 



The first order case is covered in [24"| [12] or |11| for time dependent mechanics; see also |33~1 134"] for 
the original treatment by Skinner and Rusk. The projection on the z-th factor will be denoted , pr i 
(with i — 1,2) and the projection as fiber bundle over J k ~ 1 ir will be 7Tw.7*-w = TTfe.fc— i ° W-\ (see 



l W,J k 



DiagramH]). On W, adapted coordinates are of the form (x l , uf, u^^p^p 1 ^)^ where |/| = 0, . . . , k— 1 
and \K\ = k. Note that (x % ,uf) are coordinates on J k ~ l ir and that (u%) and (p,Pa) are fiber 
coordinates on J k ir — > J k ~ 1 ir and J kr K^ — > J k ~ 1 n, respectively. 




J fc 7ft 



FIGURE 4. The Skinner- Rusk framework 

Assume that £ : J k n — > A m M is a Lagrangian density. Together with the pairing $ (Equation 
(I3.5J1 ). we use this Lagrangian C to define the dynamical map % on W (corresponding to the 
Hamiltonian density): 

(5.2) H = $-Copr 1 . 

Consider the canonical multisymplectic (m + l)-form Q on J k ^ (Equation f)3.4p ). whose pull- 
back to W shall be denoted also by Q. We define on W the premultisymplectic (m + l)-form 

(5.3) n n = n + <m. 

In adapted coordinates 

(5.4) H = (pSu^+p-Hx^ulu^)) d m x 



(5.5) 



n 



H 



Ii A„.Ct 



Ji 



dp 1 : a d< a d m -' Xt + p 1 : du% u + u % u dp 1 : - 



du a T J 



A d m x, 



where \I\ = 0, . . . ,k — 1 and \J\ — 0, . . . , k. 

We search for a 7r^A,/-transverse and locally decomposable m-multivector field X on W (see 
Appendix |A|) that is solution of the dynamical equation: 



(5.6) 



i x Vt 



H 



0. 
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Remark 5.1. The approach that we followed in [6J was to solve the equation 
(5.7) i h tt n = (m- l)flu, 

where h : TW — > TW is the horizontal projector associated to a horizontal distribution in 
7Tw,m '■ W — > M (i.e. a connection). In fact, equations ( 15. 6 P and (J5.7P are equivalent. Two 
solutions X and h of the corresponding equation are related in such a way that X generates the 
horizontal distribution to which h is associated. If a solution h of (15. 7p is locally written in the 
following form (compare with the expression ( 15. lip below) 

then its components will be governed by the relations (I5.12p . (I5.13p . (15.14]) . ( I5.15p . ( I5.18P and 

(JSH. 

Remark 5.2. Another formulation of the dynamical equation (J5.6P is 

(5.9) {-l) m t x tt = -dH, 

which is the regular expression for classical mechanics (i.e. whenever M has dimension m — 1). 
The only difference is that, in this latter one, the tangency condition to % = is already included: 
Q posses explicitly dp in its local expression, while Qy, does not. So (15. 6p may not determine any 
condition on the coefficients Cj of Xj, which is equivalent to establish whether X is tangent to 
"H = or not. 

It is shown that solutions X of ( 15. 6 p do not exist on the whole W, cf. (6J |2U H2]. Because of 
that, we need to restrict the equation to the space where solutions do exist, that is to 

(5.10) W 1 = {weW : 3X G A%T W W such that A m T w 7r WyM (X) ^ and i x ^n(w) = 0} . 

Thus, a solution X of ( 15. 6 P rather than being a multivector field on W, will be a multivector field 
along W\. 

Remark 5.3. It is important to point out that the sole purpose of the base volume form 7] is to 
ensure the existence of non-vanishing m-multivector fields on M, which implies the existence of 
7Tv^,M-transverse m-multivector fields on W. 

An arbitrary 7ryi/M-transverse multivector X G A™T W W may be written in the form 

(5.11) X = /A^A' = /A-, (J- + A},JL + B-A_ + q|) . 

If we furthermore impose the condition i](X) = 1 (/ = 1) and we compute the local expression 
of the equation ( 15. 6p . we then obtain 

(5.12) A% = v? +u , with |/| = 0, . . . , fc - 1, z = l,..., 

(5-13) = ^--B, 



3 . 



(5.14) ^ £ = B^,with\J\ = l } ...,k-l; 

rj_i — j J 



I+U=J 
^"^ Ji 



81 
(5.15) ^J ^ fl^T> with 1^1 = L 



'A' 
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We shall refer to the equation (j5.12p as the equation of holonomy and to the equations (|5.13|) . 
(j5.14p and ( I5.15P as the equations of dynamics. We will further named the equations of dynamics 
after the order of the multi-index in them, being the bottom, mid and top level ones, respectively. 
We could have written these in the more compact way 

(5.16) E *£ = d^ - B % with I J i = °> • • • > *> 

I+li=J J 

understanding that the first summation term is empty when | J\ — 0, as well as it is the last one 
when \J\ = k (there are no B^ with \K\ = k). 

Notice that the top level equation of dynamics (I5.15P is a constraint on the point w G W where 
the multivector X stands, thus it defines the submanifold W\ of W . The remaining equations, 
equation (j5.12p . (J5.13P and (j5.14p . are restrictions on the coefficients of the multivector X . 
Note also that, for the time being, the A's with greatest order multi-index and the C"s remain 
undetermined, as well as the most part of the S's. 

The existence of a 7Tw,M-transverse and locally decomposable m-multivector X G k™T w W 
(with T](X) = 1) is guaranteed for every point w G W\. However, we cannot assure that such 
multivector X is "tangent" to W\ at w. Here, "tangent" means that the distribution associated to 
X is tangent to W\ at w. Therefore, we require X to be tangent to W\ by imposing the tangency 
condition 

8L \ 
du^J 

Furthermore, we also require X to be tangent to the submanifold of W 

(5.17) W = {w G W : H(w) = 0} = {w G W : p = L - p"u a I+u } 

by imposing the second tangency condition 

x j (p+p i :^ +u -l) = o. 

These two previous conditions are equivalent to the following equations of tangency 

|J|=0 \ l I+U=J J |/|=0 

Therefore, the coefficients of the multivector X are governed by the equations of holonomy ( 15.1 2ft . 
the bottom and mid level equations of dynamics ( 15.13D and (J5.14J) . and the equations of tangency 
(I5TT8|) and (15TT9J) . 

Looking closer to the first equation of tangency ( I5.18p . we may observe that, if the matrix of 
second order partial derivatives of L with respect to the "velocities" of highest order 

(5-20) | J* 




du R du a Kj w=lKl=k 

is non-degenerate, then the highest order A's are completely determined in terms of the highest 
order B's. In the sequel, we will say that the Lagrangian C : J k ii — > A m M is regular if, for any 
system of adapted coordinates, the matrix (j5.20p is non-degenerate. 
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Note also that, thanks to the Lemma [B. 31 and the top level equation of dynamics (j5.15p . the 
terms in the equation of tangency (J5.19P that have A's with a multi-index of length k cancel 
out, while the terms that have A's with a lower multi-index are already determined (equation of 
holonomy ( I5.12p ). So, in some sense, the C's depend only on the B's. 

Theorem 5.4. Consider the submanifold W 2 = Wo D W\ ^-> W and let VL 2 = i*£lu- We have that 
(V^2, O2) is multisymplectic if and only if C is regular. 

Theorem 5.5. Let a G Ttt^m be an integral section of a solution X of the dynamical equation 
(15. 6p . Then, its "Lagrangian part" <j\ = pr ± oa is holonomic, o~\ = j k <ft for some section <f> G Tit, 
which furthermore satisfies the higher-order Euler- Lagrange equations (14. 7p . 

Theorem 5.6. Consider the system of linear equations in B's (of highest order) given by the 
mid level equation of dynamics (I5.14p and the tangency condition (I5.18P . This system has always 
maximal rank but, it is over determined when k = 1 or m = 1, completely determined when 
k = m = 2, and undetermined otherwise. 

6. Variational Calculus with Constraints 

We consider a constraint submanifold i:C4 J kr n of codimension I, which is locally annihi- 
lated by I functionally independent constraint functions \& M , where 1 < \x < I. The constraint 
submanifold C is supposed to fiber over the whole of M. 

Remark 6.1. As our ultimate goal is to find holonomic jet sections that belong to C, one could 
look for a submanifold C of C consisting of the image of such sections. The submanifold C is 
given by the constraint functions of C plus their consequences up to order k, that is, \I/ M , ^t, 

4-tj-, etc. Geometrically, C is obtained as the output of the following recursive process: 

( C, s = 0, r = k; 

(6.1) C^ := <^ n*,o(C'" 1,fc ), s > °> r = °; 

[ J 1 C^ s ' r ~ 1) n n^riC^- 1 '^), s > 0, < r < k; 

which stops when, for some step s > 0, C ( - s+1 ' fc - ) = C^ s ' k \ This algorithm is a generalization to jet 
bundles of the method given in [29J by Mendella et al. to extract the integral part of a differential 
equation in a tangent bundle. The reader is also refereed to the alternative approach by Gasqui 
|19|. 

For instance, if one considers the null divergence restriction u x + v y = in the 2nd-order jet 
manifold of pr 1 : M 3 x M? — > M 3 , then the resulting manifold C^ 2 ' 2 -* = C*- 1 ' 2 -* is given by the 
restrictions u x + v y = 0, u x t + v y t = 0, u xx + v xy = and u xy + v yy = (see Example 18. 2p . 

We now look for extremals of the Lagrangian action ( 14. ip restricted to those sections G Tit 
whose fe-jet takes values in C (see [T5| 28J). We will use the Lagrange multiplier theorem that 
follows. 

Theorem 6.2 (Abraham, Marsden & Ratiu [I]). Let Ai be a smooth manifold, f : M. — > M 
be C r , r > 1, J 7 a Banach space, g : Ai — > T a smooth submersion and M = (7 _1 (0). A point 
<p G N is a critical point of /|jy if and only if there exists A G J 7 *, called a Lagrange multiplier, 
such that cf) is a critical point of f — (A, g). 

In order to apply the Lagrange multiplier theorem, we need to define constraints as the 0-level 
set of some function g. We configure therefore the following setting: choose the smooth manifold 
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.M to be the space of local sections r^7r = {0 : R C M — > E : it o = Id^j}, for some compact 
region R C M. The Banach space J 7 is the set of smooth functions C°°(R,M. 1 ), provided with 
the L 2 -norm, for instance. The constraint function ^ induces a constraint function on the space 
of local sections T r tt by mapping each section to the evaluation of its k-Mt by the constraint, 
that is, 

g : g IVr h- * o /0 e C°°(R, R l ). 

Note that the 0-level set M = g _1 (0) is the set of sections whose k-Mt takes values in the constraint 
manifold C (over R). 

We therefore obtain that a section : M — > E is a critical point of the integral action 5 
restricted to C if and only if there exists a Lagrange multiplier A G (C°°(R, M. 1 ))* such that is a 
critical point of S — (A, g). A priori, we cannot assure that the pairing (X,g(<p)) has an integral 
expression of the type J R A M ^/ M o j k <fi d m x for some functions A M : R — > R. Henceforth, we shall 
suppose that that is the case. 

Remark 6.3. In Theorem 16.21 appears some regularity conditions that exclude the so-called ab- 
normal solutions. In general, given a critical point G M = 5 ,_1 (0) of f\jy , the classical Lagrange 
multiplier theorem claims that there exists a nonzero element (Ao, A) G K x J 7 * such that is a 
critical point of 

(6.2) A /-(A,<7). 

Under the submersivity condition on g, that is is a regular critical point, it is possible to 
guarantee that A 7^ and dividing by Ao in (16. 2p we obtain the characterization of critical points 
given in Theorem 16.21 The critical points with vanishing Lagrange multiplier, that is, Ao = 
are called abnormal critical points. 

In the sequel we will only study the regular critical points, but our developments are easily 
adapted for the case of abnormality (adding the Lagrange multiplier Ao and studying separately 
both cases, Ao = and Ao = 1). 

Proposition 6.4 (Constrained higher-order Euler-Lagrange equations). Let G Ttt be a critical 
point of the Lagrangian action S given in (14. ip restricted to those sections of it whose kth lift take 
values in the constraint submanifold C C J kr n . If the associated Lagrange multiplier A is regular 
enough, then there must exist I smooth functions X^ : R C M — > K that satisfy together with 
the constrained higher-order Euler-Lagrange equations 

(6-3) (J 2fe 0ri >(' l) |7 -T(^r--A /( --) S 0. 




7. CONSTRAINED MECHANICS IN HIGHER ORDER FIELD THEORIES 

As in the previous section, we begin by considering a constraint submanifold i : C ■— > J k n of 
codimension I, which is locally annihilated by I functionally independent constraint functions \1/ M , 
where 1 < /i < /. The constraint submanifold C is supposed to fiber over the whole of M and 
it is not necessarily generated from a previous constraint submanifold by the process shown in 
Remark 16.11 We define in the restricted velocity-momentum space Wq — {w G W : H(w) = 0} 
the constrained velocity-momentum space Wq = pr^ x (C), which is a submanifold of Wo, whose 
induced embedding and whose constraint functions will still be denoted % : Wq ^-> W and \I/ M , 
where 1 < \i < I. The first order case k — 1 is treated in [9]. 
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The following proposition allows us to work in local coordinates on the unconstrained velocity- 
momentum space W. 

Proposition 7.1. Given a point w G W$ , let X G A™(T w Wq) be a decomposable multivector and 
denote its image, i*(X) G A m (T w W), by X. The following statements are equivalent: 

(1) ix^U Y ) = ° f° r ever V Y e T w Wg; 

(2) i x Q G T°W$; 

where T®Wq is the annihilator of i*{T w WQ) in T W W . 

We therefore look for solutions of the constrained dynamical equation 

(7.1) (-l) m i x £l = -A„ dtf " - A dH, 

where X is a tangent multivector field along Wq, the A M 's and A are Lagrange multipliers to be 
determined. Here, the coefficient (— l) m is used for technical purposes. 

Remark 7.2. It should be said that the Lagrange multipliers that appear in the dynamical equation 
( 17. ip have a different nature that the ones that appear in Proposition 16.41 The former are 
locally defined on W, while the latter are locally defined on M. Although they coincide on the 
integral sections o G Ttcw,m of a solution X of the dynamical equation (17.1 p . since its "Lagrangian 
part" oi = pr 1 oa satisfies the constrained Euler-Lagrange equation ( 16 .3p with A M = A M o o (c/. 
Proposition I7.4p . 

Let X G A m (T w W) be a decomposable m-vector at a given point w G W, that is, X = 
X\ A ■ ■ ■ A A m for m tangent vectors Xj G T W W, which have the form 

— (J (J ■ (J (J 

(7 ' 2) X ' = d^ + A<} ^ + B ^dp~S + C % 

in a given adapted chart (x 1 , u 1 } , p 1 ^ , p) ■ A straightforward computation gives us 

(7.3) {-l) m ix{ dp? A d«? A d™- 1 ^) = (AftBg - A?^) dx* + Al d^ - BS du? 
and 

(7.4) (-l) m ^x( dp A d m x) = dp - d dx\ 

Applying the above equations to the dynamical one (17. ip . we obtain the relations 

coefficients in dp : 1 = A; 

coefficients in dp 1 : : A% = XuJ +1 .; 

coefficients in du" B^ = Aj^- — A^f^-; 



R Ii _ \ ( 8L _ V r,jA - \ "" • 

^ai — A I duj l^J+lj=lPa J A V-duJ-> 

fl — \ I dL _ Y^ rvlA - \ gg^ ■ 

U ~~ A I flu" l^J+\,=KPa ) A M fl««) 



/J+l J =ii"^a ; ' v M9ug-' 



coefficients in dx' : AftBg - A?^ + C, = Ag - A M ff . 

Thus, a decomposable m-vector A G A m (T w W) at a point w 6 If is a solution of the dynamical 
equation 

(7.5) (-l) m t x Q = -X fl d^- dH, 
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if for any adapted chart (3c l ,Uj,p^,p), the coefficients of X and the point w satisfy the equations 

(7.6) A%=uf +U , with|/| = 0,...,fc-l, z = l,...; 

BL <9\f 

(7-8) £ P «^-%-* With i J l = 1 '"" fc - 1; 

/+ii=J J J 

( 7 - 9 ) E ^ = ^-W' with ^i = 



I+U=K 



Bu a K ^Bu a K 

BL B^ ^ n I dL . 0*" 




("0) C, = w - K — + ^ „ S+I , A„— - ^ rf - „ ?+1(B5 . 

|j|=o \ J ■> i+u=j / 

Because of the Lagrange multipliers A M , we cannot describe the sub manifold of Wfi where 
solutions X of the constrained dynamical equation ( I7.5P exist, like it has been done in ( I5.15P for 
the unconstrained dynamical equation (15.61) . Therefore, we need to get rid off of them. Consider 
the more concise expression for the equations of dynamics (17. 7ft . ( 17. 8ft and (17. 9p 

BT B^i^ 

(7-ii) £ v 1 : = ^ - a m — - 55, with 1 j\ = o, . . . ,k, 

i+u=j j j 

where, as in (I5.16p . the first summation term is understood to be void when \J\ =0, as well 
as it is the last one when \J\ = k. We now suppose that the constraints ^ are of the type 
u a ~ = <$>°f(x l ,uf), where u°~ are some constrained coordinates which depend on the free coor- 
dinates (x l ,u a j) through the functions $^. Thus, the constraint have the form ty°~(x l ,Uj) = 
u°~ — § a Jx l ,u°~) = 0. So, writing again the previous equation ( 17. lip for the different sets of 
coordinates, the ones that are free and the ones that are not, we obtain 

BL 

(7-12) J2 P% = 7^-^ "5g, with|J|=0,...,fc; 



l+U=J 



Bu a j 



(7-13) £ j# = _ _ Xi^ - B J J, with I J\ = 0, . . . , k. 

i+U=J J J 

Substituting — A^ from (j7.12p into (I7.13p . we have that 

(7-14) £ rf + £ rf L^ = + ^ - B% - B%^ with | J, = 0, . . . ,*. 

I+U=J \I+li=J J J J J J J 

Note that, when \J\ = k, the term B & j disappears, but BJ--^ do not necessarily. This is 

circumvent by supposing that, if \J\ < k, then ■%-£■ = for any \K\ = k. That is the case when 

au K 

the constraint submanifold C has no constraint of higher order, i.e. C = 7r A r^_ 1 (7Tfcjt_ 1 (C)), or, 

more generally, when C fibers by nj, £_i over its image. 
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Taking this into account, we expand the previous equation (|7.14|) . obtaining then constrained 
equations of dynamics freed of the Lagrange multipliers 

(7,5, ' • » " ■ 6LC ■ "'* 



9$" 

E rf J 



du £ 



i+u=J 



(7.16) 
(7.17) 



E^+E^ J 



I+U=J 



I+h=J 



E p?+E p. 



/+i;=A 



/+li=A 
9$3 

- A 



du c 



J 9 ® 6 * 

Ii A 



A 



dL c 

9uJ 

8L C 



BJ-B Jj J -- 

<*3 *3 Q u a ' 



B J J A -B J A^4, with \J\ = 1, 



<-K) 



aj du & 3 



fc-1; 



9m 



9m 



with \K\ 



K 



where |^ = Jj^- + jrjk-Q^k, being C = Co i : C — > A m M the restricted Lagrangian. 

We are now in disposition to define the submanifold W% along to which solutions of the con- 
strained dynamical equation (J73J) exist, 



(7.18) 



Wk 



it: 



^(x\uj) 



{weW$ : (17371) } = < 



w G W : 



p = L{x\u a J )-p^ +1 



£rf+£p£ 

/+1;=A J+li=-K" 



9$ 



A' 



9L C 



q 9m| 



9m 



A 



Tangency conditions on X with respect to W% will give us the constrained equations of tangency 

A c 



(7.19) 
(7.20) 



J j 9a^ 
3L C 



9$ Q f . 9$" 
J + A% ' 



J i duj 



fc-i 



C* 



9^ 



+ E «!U [|-E? 



A 



|J|=0 



Afl,: = J 



fc-1 



(7.21) 



E B 5 



9$% 

I+U=j \K\=k " I+U=K 

9 2 $" 

- E rf ' 



fc-i 



P~ 



E <+^. 






I+U=K 



|J|=0 

d 2 L c 

dxWu^ 



|/|=o 



K 



+ 4 



/+1>=A 

9 2 L C 



dxidu 



A 



Ji 



-EpS 



A A 



^4 / + !. A 



du Su% 

J K , 



E 5 ' 4 



Q J Ai.a 



/+li=A 



9m 



|#| =fc. 



A 



Proposition 7.3. Let Vt^ be the pullback of the premultisymplectic form Q^ to W£ by the natural 
inclusion i : W% '—> W , that is fif = i*(Sl%). Suppose thatm = dimM > 1, then the (m+l)-form 
Q2 is multisymplectic if and only if C is regular along W% , i-e. if and only if the matrix 



(7.22) 



d 2 L c 



E 



du P k du% l __ li h 



Pa 



It K 



dufydu% 



\R\=\K\=k 
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is non- degenerate along W% . 

Proof. First of all, let us make some considerations. By definition, f^ ls multisymplectic whenever 
fig has trivial kernel, that is, 

if v G TW 2 , i v tt c 2 =0 ^^ v = . 
This is equivalent to say that 

if v G i*(TW 2 ), i v ^n\u(Tw 2 ) = ° ^=^ v = . 
Let v G TW be a tangent vector whose coefficients in an adapted basis are given by 

Using the expression (13. 6p . we may compute the contraction of VL-h by v, 

i v Q H = - B*J di# A d m ~ 1 x l + A a j dp 1 * A d m ~ 1 x l - 7 J ' dp 1 * A duj A d m ~ 2 x 

(723) + Mi+iiPa + B a l u'} +U - Aj-^j d m x 

- I 3 (pS du? +li + uf +li dp 1 : - || du?) A d 

In addition to this, let us consider a vector v G TW tangent to W 2 , that is v G ^(TW^), we 
then have that 

(dm f)T c \ 
E^+E^^f-^c (w) = and d ^) = °> 

which leads us to the following relations for the coefficients of v. 

. dm . dm 



Kl 



m— 1, 



Xj. 



dxi du'j 



2 fact 



i d 2 L c ■ 9 $ - 

(7.25) ^ B* = f ( ^-— - ^ P&q-j^K 



I+li=K \ A 7+L=X 



it r,i._i> K 



. /9 2 r c d 2 m \ dm 

' J l du?-du & ^ duldu* I ^- & du% 

UUjUU R I+U=K ua j ua k) I+U=K K 



. fit c dm 

(7.26) C .-. V " £ rf .<^ 



i+ii=i< 



^i(S-Eri'-E^S|-^? + v 






It is important to note that, even though in all the previous equations ( 17.23ft . ( 17.241) . (I7.25D and 
( I7.26P explicitly appear A's with multi-index of length k, for such a vector v G i(T14^), the 
terms associated to these A's cancel out in the development of £„£!#, Equation (I7.23p . and the 
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third tangency relation (j7.26p . Thus, a tangent vector v G z*(7W 2 ) would kill VL-h if and only if 
its coefficients satisfy the following relations 



7 



J = 0, Af = 0, B% = Q, C = 



4 = 4^ ^d 4 -"- - V p?— jPS- = 0. 

K yja R ua R I+U=K uu R uu R j 

These considerations being made, the assertion is now clear. □ 

Proposition 7.4. . Let a G Tttw,m fre an integral section of a solution X of the constrained 
dynamical equation (17. 5p . Then, its "Lagrangian part" o~\ = pr x oo is holonomic, o\ = j k <p for 
some section G IV, which furthermore satisfies the constrained higher- order Euler- Lagrange 
equations fl6.3|) . 

Proof. If X is locally expressed as in ( 15. lip , we know that it must satisfy the equations of 
dynamics (17. 7p . (17. 9p and (17. 9p . for unknown Lagrange multipliers A M . If we note A' = A^o cr and 
LI = L — A' ^, it suffices to follow the demonstration for V of Theorem 15.51 which is proven in 

n- □ 

8. Example 

Here, we study an incompressible fluid under control as in |2j. The corresponding equations 
are the Navier-Stokes one plus the divergence-free condition: 

<9v 

(8.1) — + V v v + Vn = z/Av + f 

at 

(8.2) V-v = 

where the vector field v is the velocity of the fluid, f is the field of exterior forces acting on 
the fluid, which will be our controls, and the scalar functions II and v are the pressure and 
the viscosity, respectively. In particular, our case of interest is the two dimensional case on IR 2 
endowed with the standard metric. If we fix global Cartesian coordinates (x, y) on M. 2 and adapted 
coordinates (x, y, u, v) on its tangent TM 2 = IR 4 , the previous equations become 

(8.3) u t + u ■ u x + v ■ u y + d x Il = v ■ (u xx + u yy ) + F 

(8.4) v t + u • v x + v ■ v y + c^ll = v ■ (v xx + v yy ) + G 

(8.5) u x + v y = 

where, with some abuse of notation, v(t, x, y) = (u, v) and f = (F, G). 

We therefore look for time-dependent vector fields v = (u, v) on R 2 that satisfy the Navier- 
Stokes equations (18. 3p and ( 18. 4p for a prescribed control f = (F, G) and submitted to the free 
divergence condition (18. 5p . Moreover, we look for such vector fields v = (u, v) that are in addition 
optimal in the controls for the integral action 



16) S(v,R) = - J ||f|| 2 dtA dxA dy. 

2 Jr 



In order to apply the development of the present jet bundle framework, all of this is restated 
in the following way: We set a fiber bundle vr : E — > M by putting M = RxR 2 ,E = Rx TM. 2 
and 7r = (pr^pr^a). We fix global adapted coordinates (t,x,y,u,v) on E, which induce the 
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corresponding global adapted coordinates on J k 7t and J*ir . Besides, we choose the volume form 
7] on M to be dt A da; A dy. Thus, the Lagrangian function L : J 2 n — > M. is nothing else but 

L = \{F 2 + G% 

where we obtain F and G as functions on J 2 it using the equations (J8.3P and ( |8.4|) . 

To make the reading easier, we change slightly the coordinate notation of jet bundles to fit 
in this example: The coordinate "velocities" associated to u and v will still be labeled u and v, 
respectively, with symmetric subindexes (as in the original equations); the coordinate "momenta" 
associated to u and v will now be labeled p and q, respectively, with non-symmetric subindexes. 
Finally and as we will focus on the equations of dynamics (I7.15J) . (I7.16P and (|7.17j) . the coefficients 
in the local expression (15. lip of a multivector X associated to the coordinate momenta p and q 
will be labeled B and D, respectively. 

Example 8.1 (The Euler equation). We will first suppose that the fluid is Eulerian, that is, it 
has null viscosity. In this case, the Lagrangian function L = (F 2 + G 2 )/2 associated to the 
integral action ( 18. 6 p is of first order when the "Euler equations", (18. 3p and (J8.4J) with v = 0, 
are taken into account. In J 1 ^, we consider the divergence-free constraint submanifold C = 
{z G J lr K : u x + u y = 0}, which introduces a single Lagrange multiplier A. 

Proceeding with the theoretical machinery, we compute the bottom level equations of dynamics 



correspondin 


g to those of 


(I5.13P 


















= 


u x 


■F + v x 


■G- 


-(B$ 


+ Bl + B*) 








= 


Uy 


■F + Vy 


G- 


~{D\ 


+ Dl + Dl) 


and the top 


level 


equations of dynamics (there are no 


middle ones 


(15.151) 



















corresponding to those of 



p t =F q l 

p x =u- F - X q x 

p y =v-F q y 

We can dispose of the only Lagrange multiplier A by putting 



G 

u-G 

v-G 



A 



p* -q y 



u-F-v-G, 



what defines W% together with the top level equations of dynamics with no Lagrange multiplier. 
From here, we may compute also the constrained Euler-Lagrange equations (16. 3p for this prob- 



lem, which are 



d t F + u ■ d x F + v ■ d y F + v y ■ F - v x ■ G 
d t G + u ■ d x G + v ■ d y G + u x • G — u y ■ F 



d x \ 

8yX 



d_ 

d* ' 



where d„ 

Finally, we note that L is not regular along IV^ since the square matrix, that correspond to 

(E22D, 

/ 1 u v \ 

— v —u ■ v 



1 u 



u 


u 2 + v 2 


u ■ V 


V 


u ■ V 


v 2 





—v 






V 



o 



-u ■ V 







u 



u- 



J 
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has obviously rank 2. Here we have used as u x as independent ("check") coordinate and v y as 
dependent ("hat") coordinate. 

Example 8.2 (The Navier-Stokes equation). Now, we tackle the full problem of the Navier-Stokes 
equations. In this case, the Lagrangian function L = (F 2 + G 2 )/2 is of second order. In J 2 tt, we 
consider the constraint submanifold 

C = [z e J 2 n : u x + u y = 0, u tx + v ty = 0, u xx + v xy = 0, u xy + v yy = 0} 

which comes from the first order constraint (18.21) . free divergence, and its consequences to second 
order (see Remark l6.ip . These constraints introduce for Lagrange multiplier A, X t , X x and X y that 
are associated to them respectively. 

We now proceed like in the previous example by computing the equations of dynamics. In first 
place, we have the bottom level ones corresponding to those of (J5.13P 

= u x -F + v x -G-{Bt + B x x + BD 

= uyF + v y -G-(D$ + D* + Dy y ) 

Note that they are formally the same as before. In second place, the mid level equations corre- 
sponding to those of (I5.14J1 

p t =F- (Bf + B tx + B ty ) q l =G- (Df + D tx + D ty ) 

p*=u-F- (Bf + B x x x + Bf) + A q x = u ■ G - (D'f + D xx + D x y y ) 

p y=vF- (Bf + Bf + Bf) q y = v ■ G - (Df + D\ x + Df) + A 

Note that formally they also coincide with the top level ones of the previous example but for the 
coefficients that now appear in them. And in third place, the top level equations corresponding 
to those of (I5TT5J) 

p tt =0 q tt = 

p** = _ u . F - \ x q xx = - u ■ G 

p yy = - v .f q yy = - v ■ G - \ y 

p tx + p xt = _ ^ q tx + q xt =Q 

pty +p yt =0 q ty + q yt = - \ t 

P xy + p yx =-X y q *y + q y*=-\ x 

We can again get rid easily of the Lagrange multipliers by putting 

p tx + p xt = q l y + q yt p xx + v-F = q xy + q yx p xy + p yx = q yy + v ■ G 

what defines Wf together with the top level equations of dynamics with no Lagrange multiplier. 
From here, we may compute also the constrained Euler-Lagrange equations (16. 3p for this prob- 
lem, which are 

2d 2 tx \ t + d 2 xx \ x + 1d 2 xy \ y -d x \ = d 2 xx vF + 2d x vd x F + vd 2 xx F + 

+d 2 yy u ■ F + 23 y v ■ d y F + v ■ d 2 yy F - 

— d t F — u ■ d x F — v ■ d y F — v y ■ F + v x ■ G 
2d 2 y X t + 2d 2 xy X x + d 2 y X y - d y \ = d 2 xx v ■ G + 2d x v ■ d x G + v ■ d 2 xx G + 

+d 2 yy v ■ G + 2d y v ■ d y G + v ■ d 2 yy G - 

— d t G — u ■ d x G — v ■ d y G — u x ■ G + u y ■ F 
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As before, the Lagrangian is not regular along W$ , what seems to be clear if we observe that L is 
highly non-degenerate: It depends only on 4 of the 12 coordinates of second order. It is worthless 
to show its "Hessian", even though it is interesting to say that it is null only when v is. 

9. CONCLUSIONS AND FUTURE WORK 

In this paper, we have introduced an unambiguous geometric formalism for higher order field 
theories subjected to constraints, with applications to optimal control of partial differential equa- 
tions. Our theory is based on the classical Skinner and Rusk formalism and the theory of higher- 
order jet bundles. In the future, we will do a detailed study of how to derive a constraint algorithm 
derive, in particular we will pay attention on the necessary conditions in order to the algorithm 
do not stop at the primary constraint submanifold; for instance, restricting the dual jet bundle to 
an appropriate submanifold (in agreement with the ideas behind [32j). The design of variational 
integrators for these higher order field equations will be also analyzed (see [28j for the first order 
case). 

Besides, it is worth mentioning now that the BRST construction for field theory has been 
used systematically to deal with the problem of quantizing Gauge Systems (c/. [21)). Gauge 
systems are an instance of constrained field theories where the constraints arise because of the 
Gauge invariance of the system. The BRST construction is based on the introduction of auxiliary 
odd and even degrees of freedom (referred as "ghosts" in physics literature) in such a way that 
the supermanifold structure of the extended formalism allows for a cohomological description of 
the reduced spaces, i.e., of the relevant degrees of freedom from a physical point of view (see for 
instance [22J for a simple description of the reduction procedure in terms of supergeometry) . This 
construction can be put in a nice geometrical setting for first order Lagrangian mechanics (see for 
instance [16]) and we are looking forward to extend such formalism for higher order Lagrangian 
mechanics and Lagrangian field theories. 

Appendix A. Multivectors 

Let P be a n-dimensional different iable manifold. Sections of A m (TP) (with 1 < m < n) 
are called m-multivector fields in P. We will denote by X m (P) the set of m-multivector fields 
in P. Given X G X m (P), for every p G P, there exists an open neighborhood U p C P and 
Xi, . . . ,X r G X{U p ) such that 

X = P J2 f 1 "^** A • • • A X im 

l<ii<...<i m <r 

with f l i- tm g C°°(Up) and m < r < n. A multivector field X G X m (P) is locally decomposable if, 
for every p G P, there exists an open neighborhood U p C P and Xi, . . . , X m G X(U p ) such that 

X = X 1 A...AX m . 

We will denote by 3£™(P) the set of locally decomposable m-multivector fields in P. 

Let D C TP be an m-dimensional distribution. The sections of A m D are locally decomposable 
m-multivector fields in P. A locally decomposable m-multivector field X G £™(P) and an Tri- 
dimensional distribution D C TP are associated whenever A is a section of A m D. If X,X' G 
X™(P) are non-vanishing multivector fields associated with the same distribution D, then there 
exists a non- vanishing function / G C°°(P) such that X' = fX. This fact defines an equivalence 
relation in the set of non-vanishing m-multivector fields in P, whose equivalence classes will 
be denoted by T>(X). There is a bijective correspondence between the set of m-dimensional 
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orientable distributions D in TP and the set of the equivalence classes T>(X) of non-vanishing, 
locally decomposable m-multivector fields X in P. By abuse of notation, T>(X) will also denote 
the m-dimensional orientable distribution D in TP with whom X is associated. 

An m-dimensional submanifold S "-^ P is said to be an integral manifold of X G X m (P) (resp. 
of an m-dimensional distribution D in TP) if X spans A m TS (resp. if TS — D), In such a case, 
X (resp. D) is said to be integrable. Integrable multivector fields shall be locally decomposable. 
A non-vanishing, locally decomposable m-multivector X G £™(P) is involutive if its associated 
distribution T>(X) is involutive, that is, if [D(X),T>(X)] C D(X). If a non-vanishing multivector 
field X G 3£™(P) is involutive, so is every other in its equivalence class T>(X). By Frobenius' 
theorem, a non-vanishing and locally decomposable multivector field is integrable if, and only if, 
it is involutive. 

Now, let it : P — y M be a fiber bundle with dimM = m. A multivector field X G X m (P) 
is said to be n -transverse if A m 7r*(X) does not vanish at any point of M, hence M must be 
orientable. If X G 3L m (P) is integrable, then X is 7r-transverse if, and only if, its integral 
manifolds are sections of tx : P — y M. In this a case, if S is an integral manifold of X, then 
there exists a section G IV such that S = lm(0). 

For more details on multivector fields and their relation with field theories, refer to 



Appendix B. The multi-index notation 
Given a function / : R m — y R, its partial derivatives are classically denoted 



■In 



l2---lk 



\J *Xj iy-\ \J \Xj ijry L/ \Xj n-t 



When smooth functions are considered, their crossed derivatives coincide. Thus, the order in 
which the derivatives are taken is not important, but the number of times with respect to each 
variable. 

Another notation to denote partial derivatives is defined through "symmetric" multi-indexes 
(see |31j). A multi-index / will be an m-tuple of non-negative integers. The i-th component of / 
is denoted I(i). Addition and subtraction of multi- indexes are defined component- wise (whenever 
the result is still a multi-index), (I±J)(i) = I(i)±J(i). The length of/ is the sum |/| = 'Yl li P\i)i 
and its factorial J! = ILj I (i)!. In particular, lj will be the multi-index that is zero everywhere 
except at the i-th component which is equal to 1. 

Keeping in mind the above definition, we shall denote the partial derivatives of a function 
/ : R m — ► R by: 

Q\I\f Ql(l)+I(2)+-+I(m) f 

fl 



OX Finn. F)t„ ' ■ ■ • Finn J: ' 



dx x dx 2 ■ ■ -dx r 

Thus, given a multi-index J, I(i) denotes the number of times the function is differentiated with 
respect to the i-th component. The former notation should not be confused with the latter one. 
For instance, the third order partial derivative Qx d J dx (with / : M 4 — y R) is denoted / 2 32 and 
/(o,2,i,0), respectively. 

Here we present some simple but useful results on multi-indexes. 
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Lemma B.l. Let {aj i } Ii be a family of real numbers indexed by a multi-index I G N m and by 
an integer i such that 1 < i < m. Given an integer k > 1, we have that 



:b.i) E !>.<=£ E «', 

|7|=fe-l i=l |J|=feJ+li=J 



Lemma B.2. Let J G N m fre a non-zero multi-index. We have that 

V ^ Z ^ / +1 

/+L=J ' ' 

Lemma B.3. Let {aj}j be a family of real numbers indexed by a multi-index J G N m . Given a 
positive integer I > 1, we have that 

\j\=i \i\=i-i i=1 

Lemma B.4. Let {aj, 6 J } be a family of real numbers indexed by a multi-index J G N m . Given 
an integer I > 1, we have that 

(b.4) E &J ^ = E E|7r (6r+l4+Q/,<)fl ^' 

\j\=i i-fM-i i=i 

where {Q 1 ' 1 } T ■ is a family of real numbers such that for any multi-index J G N m (with \J\ > 1) 
we have that 

(B.5) E ^q-- = 0. 

i+ii=J ' ' 
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